Aerial Suspended Cargo Delivery through Reinforcement Learning
نویسندگان
چکیده
Cargo-bearing Unmanned aerial vehicles (UAVs) have tremendous potential to assist humans in food, medicine, and supply deliveries. For time-critical cargo delivery tasks, UAVs need to be able to navigate their environments and deliver suspended payloads with bounded load displacement. As a constraint balancing task for joint UAV-suspended load system dynamics, this task poses a challenge. This article presents a reinforcement learning approach to aerial cargo delivery tasks in environments with static obstacles. We first learn a minimal residual oscillations task policy in obstacle-free environments that find trajectories with minimized residual load displacement with a specifically designed feature vector for value function approximation. With insights of learning from the cargo delivery problem, we define a set of formal criteria for class of robotics problems where learning can occur in a simplified problem space and transfer to a broader problem space. Exploiting this property, we create a path tracking method that suppresses load displacement. As an extension to tasks in environments with static obstacles where the load displacement needs to be bounded throughout the trajectory, sampling-based motion planning generates collision-free paths. Next, a reinforcement learning agent transforms these paths into trajectories that maintain the bound on the load displacement while following the collision-free path in a timely manner. We verify the approach both in simulation and in experiments on quadrotor with suspended load.
منابع مشابه
PRM-RL: Long-range Robotic Navigation Tasks by Combining Reinforcement Learning and Sampling-based Planning
We present PRM-RL, a hierarchical method for long-range navigation task completion that combines samplingbased path planning with reinforcement learning (RL) agents. The RL agents learn short-range, point-to-point navigation policies that capture robot dynamics and task constraints without knowledge of the large-scale topology, while the sampling-based planners provide an approximate map of the...
متن کاملLearning Swing-free Trajectories for UAVs with a Suspended Load in Obstacle-free Environments
Attaining autonomous flight is an important task in aerial robotics. Often flight trajectories are not only subject to unknown system dynamics, but also to specific task constraints. We are interested in producing a trajectory for an aerial robot with a suspended load that delivers the load to a destination in a swing-free fashion. This paper presents a motion planning framework for generating ...
متن کاملAutonomous UAV Navigation Using Reinforcement Learning
Unmanned aerial vehicles (UAV) are commonly used for missions in unknown environments, where an exact mathematical model of the environment may not be available. This paper provides a framework for using reinforcement learning to allow the UAV to navigate successfully in such environments. We conducted our simulation and real implementation to show how the UAVs can successfully learn to navigat...
متن کاملFuzzy Adaptive Control of Unmanned Aerial Vehicle for Carrying Time-Varying Cargo on Predefined Path
At present, the use of unmanned aerial vehicles (UAVs) has been increased dramatically. The reasons for this development are cheapness, smallness, simplicity, and diversity of missions. The simplicity of guidance and control of multi-rotor drones is that they are equipped with an autopilot system. This system is responsible for flying control. UAVs do not have a high weight and often have three...
متن کاملTime-Varying Formation Controllers for Unmanned Aerial Vehicles Using Deep Reinforcement Learning
We consider the problem of designing scalable and portable controllers for unmanned aerial vehicles (UAVs) to reach time-varying formations as quickly as possible. This brief confirms that deep reinforcement learning can be used in a multi-agent fashion to drive UAVs to reach any formation while taking into account optimality and portability. We use a deep neural network to estimate how good a ...
متن کامل